Skip to content

Conversation

zbeloki
Copy link

@zbeloki zbeloki commented Sep 20, 2024

When computing the F0.5 score, macro-averaging is sometimes preferred over micro-averaging. Currently, Errant only supports micro-averaging by default, which is suitable when the test set is derived from real texts. However, if the test set consists of manually crafted sentences with grammatical errors, often grouped by error types, it is often preferable for all error types to contribute equally to the overall F-score.

This pull request introduces a new argument, f_average, to compare_m2.py. It accepts two values: micro (default) and macro.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants